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Abstract. We present an approach to propagation based solving, Boolean 
equi-propagation, where constraints are modelled as propagators of in- 
formation about equalities between Boolean literals. Propagation based 
solving applies this information as a form of partial evaluation resulting 
in optimized SAT encodings. We demonstrate for a variety of bench- 
marks that our approach results in smaller CNF encodings and leads to 
speed-ups in solving times. 



1 Introduction 

In recent years, Boolean SAT solving techniques have improved dramatically. 
Today's SAT solvers are considerably faster and able to manage far larger in- 
stances than yesterday's. Moreover, encoding and modeling techniques are better 
understood and increasingly innovative. SAT is currently applied to solve a wide 
variety of hard and practical combinatorial problems, often outperforming ded- 
icated algorithms. The general idea is to encode a (typically, NP) hard problem 
instance, P, to a Boolean formula, epp, such that the solutions of P correspond 
to the satisfying assignments of <pp. Given an encoding from problem instances 
to Boolean formula, a SAT solver is then applied to solve the problem instances. 

Tailgating the success of SAT technology are a variety of tools which can 
be applied to specify and then compile problem instances to corresponding SAT 
instances. Typically, a constraint based modelling language is introduced and 
used to model instances. Then encoding techniques are applied to compile con- 
straints to the language of an underlying solver such as SAT, SMT, or others. 
Some examples follow: In [5], Cadoli and Schaerf introduce a logic-based spec- 
ification language, called NP-SPEC, which allows one to specify combinatorial 
problems in a declarative way. At the core of their system is a compiler which 
translates specifications to CNF formula. Sugar [19], is a SAT-based constraint 
solver. To solve a finite domain linear constraint satisfaction problem it is first 
encoded to a CNF formula by Sugar, and then solved using the MiniSat solver [8]. 
MiniZinc [14], is a constraint modeling language which is compiled by a variety 
of solvers to the low-level target language FlatZinc. For example, f zntini [13] 
solves FlatZinc instances by encoding them as CNF. In [3], the authors present 
f zn2smt, a tool for solving FlatZinc instances by encoding them to SMT-LIB [2]. 



Taking the analogy with programming languages, we want to facilitate the 
process of providing a high-level description of how the (constraint) problem at 
hand is to be solved. Given such a description, a compiler can then provide a, 
low-level executable for the underlying machine. In our context, the low-level 
executable is a CNF formula, and the underlying machine, a SAT solver. One 
advantage in the use of such tools is that the user can easily experiment with 
a variety of modeling options without the need to tediously encode each as a 
CNF formula. Another advantage is that optimizations can be handled in the 
compiler, and not each time from scratch. 

A major obstacle when solving combinatorial problems using SAT technology 
is that CNF encodings do not maintain the context of the bits specified in the 
constraint model. For example, the information that certain bits in the CNF 
encoding originate from a vector representing an integer value in the constraint 
model, is lost. Consequently, our ability to apply optimizations related to the 
original vector diminish. 

This paper takes a new approach, defining the notion of an equi-propagation 
solver. Prior to SAT encoding, constraints are viewed as propagators of infor- 
mation about equalities between Boolean literals and constants. We repeatedly 
extract such equalities and then factor them out of the original constraint prob- 
lem. We demonstrate that this significantly reduces the size of the constraint 
problem and the subsequent SAT solving time. A novel and efficient implementa- 
tion of equi-propagation using binary decision diagrams (BDD's) [4] is described. 
Drawing on the programming languages analogy, we contribute an optimizing 
compiler for SAT encoding. Equi-propagation and partial evaluation facilitate 
optimization of the constraint model. This, fast (polynomial-time) optimization 
phase is followed by the more costly (exponential-time) SAT solving phase. 



2 Overview 



Constraints in our modelling language are viewed as a Boolean functions about 
the underlying bit representation for finite domain integer variables and other 
Boolean variables. We focus on a unary representation, the order encoding, for in- 
teger variables. Consider three (standard) constructs in the modelling language. 

unary„(X,[a,fe]) [|] differ, X 2 ) [§] allDif f QX U . . . , X n }) 

A constraint unary n (X, [a, b]) where < a < b < n specifies a finite domain 
integer variable X = (xi, . . . , x n ), represented in n bits, which takes values 
in the interval [a, 6]. We denote by dom(X) the finite set of values that vari- 
able X can take. Initially, dom(X) = {a,..., b}. When clear from the con- 
text, we drop n from the notation. A constraint, dif f (X%, X 2 ), specifies that 
integer variables (bit vectors) X\ and X2 take different values from their re- 
spective domains. The third construct, allDif f ([Xi, . . . , X m ]), specifies that 
integer variables [X\ , . . . , X m ] all take different values from their respective 
domains. The argument of this constraint is a list of bit vectors. We denote 
dom({Xi,...,X m }) = U{ dom{X l ) | 1 < i < m }. 
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In the order encoding (see e.g. [6,1]), the bit vector representation of inte- 
ger variable X = (x\, . . . ,x n ) constitutes a monotonic decreasing sequence. For 
example, the value 3 in 5 bits is represented as (1,1,1,0,0). The bit Xi (for 
1 < i < n) is interpreted as the statement X > i. Throughout the paper, for a 
bit vector X = (x\, . . . , x n ) representing an integer in the order-encoding, we as- 
sume implicit bits Xq = 1 and = 0, and denote X(i) = Xi for < i < n+1. 
The order encoding is also used in Sugar [19]. 

The Boolean functions corresponding to constraints [T] - 3 are as follows 
(where 1 < a < b < n): 

uaary((a;i, . . . ,x n ), [a, b]) = A (xi—i < Xi) A x a A —*Xb+\ 

i=l n 

d±ff((xi,...,x„),(yi,...,y n )) = \J (xt xor y t ) (1) 

i=i 

allDiff([Xi,...,X m ]) = /\ diffpfi.X,-) 

l<i<j <m 

For constraint c with integer variable arguments, we denote by c u the conjunction 
of c with the statement that its arguments are represented in the order-encoding. 
For example, dif f U (X, Y) = dif f (X, Y) A unary„(A, [0, n]) A unaryjY, [0, n}). 

An important property of a Boolean representation for finite domain integers 
is the ability to represent changes in the set of values a variable can take. It 
is well-known that the order-encoding facilitates the propagation of bounds. 
Consider an integer variable X = (xi, . . . , x n ) with values in the interval [0, n]. 
To restrict X to take values in the range [a, b] (for 1 < a < b < n), it is sufficient 
to assign x a = 1 and a^+i = (if b < n). The variables x a > for > a' > a and 
b < b' < n are then determined true and false, respectively, by unit propagation. 
For example, given X = {x\, . . . ,xg), assigning xs = 1 and xq — propagates 
to give X = (1, 1, 1, X4, X5, 0, 0, 0, 0), signifying that dom(X) C {3, . . . , 5}. 

A lesser known property of the order-encoding is its ability to specify that a 
variable cannot take a specific value < v < n in its domain by equating two 
variables: x v = x v +\. This indicates that the order-encoding is well-suited not 
only to propagate lower and upper bounds, but also to represent integer vari- 
ables with an arbitrary, finite set, domain. For example, for X = (xi, . . . ,xg), 
equating X2 = £3 imposes that X ^ 2. Likewise 25 = xq and .T7 = xg im- 
pose that X 7^ 5 and X ^ 7. Applying these equalities to X gives, X = 
(xi, X2, X2,X4, X5, 25, xt, xt, Xg) , signifying that dom(X) = {0, 1, 3, 4, 6, 8, 9}. 

The idea in this paper is to simplify constraints, prior to their encoding to 
CNF, using a technique we call equi-propagation. We distinguish between low- 
level constraints, such as unary(X, [a, b]) and diff(Ai, X2), which are about a 
fixed number (one and two) of integer variables, and high-level constraints, such 
as allDif f ([Ai, . . . , X m ]). Low- level constraints are simplified and then encoded 
directly to CNF, while high-level constraints are simplified and then decomposed 
to low-level constraints. We consider three types of simplification rules. To illus- 
trate these, consider the constraint diff(A, Y) where X = (x\, X2, 23, 24) and 
Y = (2/1,2/2,2/3,2/4) are unary variables in the order-encoding. 
(1) equi-propagation, where we propagate information about equalities between 
Boolean literals and constants. For example, given equalities s.t. Y = (1, 1,0,0} 
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we propagate that (x2 = £3) because dif f u ((xi, x 2 , X3, X4), (1, 1, 0, 0)) |= (x% = 
X3). (X is in the order-encoding, so X2 > X3, and X2 < £3 as otherwise 22 = 1 
and X3 = which implies that X = (1, 1, 0, 0), contradicting dif f (X, Y)). When 
we detect such equalities, we apply them to simplify the constraints in a model. 

(2) redundant constraint elimination, where we discover that, due to equalities, 
a constraint is redundant. For example, when Y = (1,1,0,0) and X2 = X3, 
the constraint d±ff(X, Y) is redundant because unaxy((a;i, x%, X3, X4), [0,4]) |= 
diff u ((xi,X 2 ,2C3,»4)i (1, 1,0,0)). 

(3) constraint restriction, where we discover that some bits in a constraint c are 
"dont-cares" and project c to the remaining variables. For example, when xi=l 
and X2=l then y\ is a don't care and diff(X,Y) is equivalent to &iff(X',Y') 
where X' = (x2, X3, X4) and Y 1 = (j/2j V3, Hi}- To see why unary(X, [0, 4]) A 
unary(Y, [0, 4]) A x x = 1 A x 2 = 1 |= dif f {X, Y) O dif f {X', Y'), consider that 
if yi = then also y 2 = and both constraints are true, and if y\ = 1 then 
Xi xor y t = false and diff(A,F) O diff(X',F') follows. 

In addition to simplification rules, we apply decomposition rules to high- 
level constraints. For example, an allDif f constraint decomposes naturally to 
a set of constituent dif f constraints. The rule we apply to decompose allDif f 
constraints is as follows: 

allDiff ([ITi, . . . , U m ]) ^ { diff (U h Uj) 1 1 < i < j < m } , (2) 

permutation^ ([Ui, . . . , U m ]) 

where permutation^ is a redundant constraint. 4 Its role is to introduce redun- 
dant clauses to accelerate SAT solving for the special case when the allDif f 
constraint specifics a permutation (m variables taking m different values). By de- 
laying the special treatment of allDif f constraints which specify permutations 
we can often detect more permutations than prior to constraint simplification. 
The precise specification of the permutation^ constraint is given in Section 4. 

3 Boolean Equi-Propagation 

Let B be a set of Boolean variables. A literal is a Boolean variable b € B or its 
negation -16. The negation of a literal £, denoted is defined as -16 if i = b 
and as b if i = ->b. The Boolean constants 1 and represent true and false, 
respectively. The set of literals is denoted £ and £ ,i = £U {0,1}. 

An assignment, A, is a partial mapping from Boolean variables to constants, 
often viewed as the set of literals: { b | A(b) — 1 } U { — ■& | A(b) = }. For a for- 
mula ip and b £ B, we denote by ip[b] (likewise <p[ _, &]) the formula obtained by 
substituting all occurrences of b e B in ip by true (false). This notation extends 
in the natural way for sets of literals. We say that A satisfies ip if p[A] evaluates 
to true. A Boolean Satisfiability (SAT) problem consists of a Boolean formula p 
and determines if there exists an assignment which satisfies ip. The set of (free) 
Boolean variables that appear in a Boolean formula p is denoted vars(ip). 

4 the symbol # in the name of a constraint indicates that it is redundant. 
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A Boolean equality is a constraint £ = £' where £,£' G Co,i- A (Boolean) 
equi-formula E is a set of Boolean equalities understood as a conjunction. The 
set of equi-formulae is denoted £. 

Equi-propagation is the process of inferring new equational consequences 
from the constraints of a model and existing equational information. An equi- 
propagator for Boolean formula tp is an extensive function fj, v : £ — > £ (namely, 
s.t. fi v (E) —> E) defined s.t. A{ ee£\tpAE\=e} — > fi v (E). That is, a con- 
junction of Boolean equalities, at least as strong as E, made true by tp A E. 
We say that equi-propagator fi v is complete if for all equi-formula E, ti v (E) <R- 
{ e <E £\tp A E \= e} . We denote a complete equi-propagator for tp as jl v . 

Example 1. Let X = (x\, X2, X3, X4) and Y = (yi, y 2 , y 3 , 1/4) and consider E\ = 
{ Vi = 1, V2 = 1, 2/3 = 0, yi = } and E 2 = { x 2 = -^3, x 3 = ^y 2 }. Then, 
Adiff„(x,Y)(£i) = Ei U {.x 2 = x 3 } and also Adiff„(x,y) (E 2 ) = fidm u (x,Y)(Ei). 

Theorem 1. Complete equi-propagation is uniformly stronger than unit propagation. 

Proof. Suppose formula ip |= C where C = [£\ V- • • \/l n ) is a clause. Assume also 
that E G £ is such that |= . . . , E |= -i£„-i- Unit propagation from CAE 
will infer £„. Clearly tp A E |= (£ Y V • • ■ V £ n ) A ^£ x A ■ ■ • A "A-i h 4 = 1 and 
hence {£„ = 1} G jlip(E). Thus complete equi-propagation will infer everything 
inferred by unit propagation for any clausal representation of ip. 

Boolean Unifiers It is convenient to view equi-formula in a generic "solved- 
form" as a substitution, 9e, which is a (most general) unifier for the equations in 
E. Boolean substitutions generalize assignments in that variables can be bound 
also to literals. A Boolean substitution is an idempotent mapping 9 : B — > >Co,i 
such that dom{9) = { b£B | 9(b)^b } is finite and V.6g£>. 9(b) ^ -16. It is viewed 
as the set 9— { b H> 9(b) | b £ dom(9) }. We can apply 9 to another substitution 
0', to obtain substitution (9-9') = { 6 ^ 9(9' (b)) | b G dom(9) U dom(9') }. A 
unifier for equi-formula i? is a substitution 9 such that |= 9(e), for each e G E. 
A most-general unifier for _E is a substitution such that for any unifier 9' of 
E, there exists substitution 7 where 9' = 7 • 9. 

Example 2. Consider the equi-formula E = {b\ = ~^b 2 , -163 = -164, 65 = 65, &6 = 
64, 67 = 1, 6 8 = -i& 7 } then a unifier 9 for i? is {6 2 ^ ~ 64 M> 63, b 5 M> 63, 6 6 n> 
63,67 i-> 1,6 8 n> 0}. Note that 0(£') is the trivially true equi-formula {&! = 

-1&3 = -163, & 3 = 63,63 = 63,1 = 1,0 = -4}. 

Let -< be a total (strict) order on B, extended to an order on £0,1 such that 
0^1 and V.6 G B, 1 -< 6 and 6 « -16. We define a canonical most-general unifier 
for any satisfiable equi-formula E: unify B = A6. min { I G £0.1 | E |= 6 = € }. 
We can compute unif y £ in almost linear (amortized) time using a variation of 
the union-find algorithm [20]. 

Example 3. For the equi-formula E and substitution 9 from Example 2 we have 
that unif y E = 9 where the ordering is -< 1 -< 61 -< b 2 -< ■ ■ ■ -< 6 8 . 
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The following allows us to replace formula tp by unify E (tp), and provides an 
alternative, more efficient to implement, definition for complete cqui-propagation. 



Before presenting the proofs of Propositions 1 and 2 we present two additional 
propositions. First some notation: The projection of variable b from tp is the 
formula (Bb.tp) defined by y?[6] Vy>[-i6]. The projection (BB.tp) for a set of variables 
B C £>, is defined in the natural way. 

Proposition 3. Let tp be a propositional formula and E an equi-formula. Then, 
3dom(unify E ). (E A tp) unify E (tp). 

Proof. Let A be a satisfying assignment for vmify E (ip). Since unify E (ip) does 
not involve variables in dom(xm±t y E ) we can extend A to 



By construction, A' is a satisfying assignment of E A ip because \/£ = £' G E, 
A'(£) = A'(£'), and A'(tp) = A-xmify E (tp) is a tautology, and hence A' \= tp. So, 
A is a satisfying assignment for EAtp. Let A' be a satisfying assignment for EAtp. 
Then A' is a unifier of E. Hence, W = £' G E, A'{£) = A'{£') which implies that 
Vfe h-» I G unify B , A' (6) = A'(£). Let A = { & h-> £ G A' | 6 ^ dom(unif y B ) }. 
Now A(unif y E (tp)) = A'(tp) by construction and -A'(<^) is a tautology. Hence A 
is a satisfying assignment of unif y E (tp). □ 

Proposition 4. Let tp be a formula, and E a set of Boolean equalities, and e a 
Boolean equality tp A E (= e z/f unify E (tp) \= unify E (e). 

Proof. (<=) Suppose unify B (<^) \/= unifyg(e) then there is a satisfying assign- 
ment A of unif y E (tp) where A(unif y E (e)) is false. By Proposition 3, A is also 
a model of 3dom(unif y E ).E A tp and hence can be extended to a solution A' of 
E Atp. But A'(e) = A (unif y E (e)) by definition, hence E A <p ^ e. 
(=>) Suppose tp A E Y= e then there is a satisfying assignment A of tp A E where 
A(e) is false. A \= E Atp then A models unify E (tp). Similarly since A is a model 
of £7 A(e) = A(unif y E (e)), hence unify^^) Y= unify B (e). □ 

Proof (of Proposition 1). Clearly tpAE — > 3dom(unify E ).tpAE <-» unify s (y) by 
Proposition 3 and hence t/sA-B — > unif y E {tp) AE. For the reverse implication, let 
A be a solution of unif y E (tp) AE, then it is a unifier of E and hence A(6) = A(l) 
for each 6 i— > £ G unify^ Then A |= 93 since A \= unify E (tp) and these two only 
differ by replacing b with I where b 1— > Z G unif y B . Hence unif y E (tp)AE — > tpAE. 



Proof (of Proposition 2). Let £" = £? A {e G £ | unify E (tp) ^= e}. Clearly 
fiip(E) — > E and fi v {E) — > e where unify E (tp) — > e since E A tp — > unif y E (tp) 
by Proposition 3. Hence jl v {E) — > E' . Consider e G jl v (E). Then 93 A E —> e. 
Hence unify E (tp) —> unify E (e) by Proposition 4, so unify B (e) G E' Finally 



Proposition 1. 

tp AE <R- unify E (tp)AE 



Proposition 2. 

A^(-E) O EA{e G 5 I unify B (v?) |= e} 



A' = A U { 6 (->• A(unif y E (b)) | 6 G dom(unif y E ) } 



□ 




□ 
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(a) BDD for diff U (A, B) (b) Simpl'd wrt B=[l, 1, 0] (c) Simpl'd wrt A 2 =A 3 

Fig. 1. BDDs for the formula (a) tp = unary 3 (A, [0, 3]) A unary 3 (B, [0, 3]) A diff (A, B) 

(b) unify B (<p) where E = {B\ = l,i?a = l,Bz — 0} and (c) unify E ,(y>) where 
= E(J{A2 = A^}. The full (dashed) line corresponds to the true (false) child. Edges 

to the target "F" are omitted for brevity. 

Implementing complete equi-propagators A complete equi-propagator is 
straightforward to implement using binary decision diagrams (BDDs) . Consider 
Boolean formula tp and cqui-formula E. Then, for equation (l\ — £2), based on 
Proposition 2, we can test the condition, unify £ (y) |= (ii o £2) using a stan- 
dard BDD containment test e.g., "bddLeq" in [18]. This test can be performed for 
all relevant equations involving variables from xmlfy E (tp) (and constants 0,1). 

Example 4- Consider the BDD shown in Figure 3(a) which represents the for- 
mula: tp = unary 3 (A, [0,3]) A unary 3 (B, [0,3]) A diff(A,B). Suppose that E 
is { Bi = 1, B 2 = 1, B 3 = }. The BDD for unify^) is shown in Fig- 
ure 3(b). It is easy to see from the BDD that equi-propagation determines that 
unif y E (tp) h A 2 = A 3 . Indeed jx^E) = E' = E U {A 2 = A 3 }. 

We apply complete equi-propagation in cases when BDDs are guaranteed to 
be polynomial in the size of the constraints they propagate for. The following 
result holds for an arbitrary constraint tp, so it also holds for unif y E (tp). 

Proposition 5. Let c(Xs) be an arbitrary constraint about integer variables 
Xs = [X\ , . . . , Xk] each represented with n bits in the order encoding. Then, the 
number of nodes in the BDD representing c(Xs) is bound by 0(n k ). 

Proof. (Sketch) There are only n + 1 legitimate states for each n bit unary 
variable, and the BDD cannot have more nodes than possible states. □ 
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Implementing adhoc equi-propagators Most simple constraints have a fixed 
small arity and hence complete equi-propagators using BDD are polynomially 
bounded. However, this is not the case for global constraints where the arity 
is not fixed. In this case we can define an adhoc, possibly incomplete, equi- 
propagator. We demonstrate this for the allDif f ([Z7i, . . . , U m ]) constraint where 
each U{ is represented in n bits. 

Example 5. Consider Us = [U±, . . . , U5] where the U, = (xu, . . . , 2^9} are integer 
variables in the range [0,9]. Given E, we denote unif y B (/7s)=[?7{, . . . , U5] and 
illustrate equi-propagator fJ,tp(E) = E U E' for <£>=allDif f u (Us): (1) Consider 
E = {a;i 2 =l,a;i3=0}. Denoting E a = { x y = 3 < 2 | 1 < j < 9 } (e.g. U[ = 2), 
and Eb = { x i2 ~ Xis\2 < i < 5 } (e.g. U[ ^ 2 for i > 1), the propagator adds 
E' = E a U E b . (2) Consider E = E b U E c where E c = { x i5 = 1 1 < % < 5 } 
(e.g. U < 4) and £{, is from the previous case. In this case, only U\ can take 
the value 2. So a propagator adds equations imposing that U[ = (1, 1, 0, . . . , 0). 
(3) Consider E = E c U Ed where Ed = [j { xu = 2^2,^13 = %a | 3 < i < 5 } 
(e.g. only U\ and U2 can take the values 1 and 3) and E c is from the previous 
case. A propagator adds E' = U { Xu = 1, x i2 = a;,3, ij4 = ie {1, 2} }. 

The following is essentially the usual domain consistent propagator for the 
allDif f constraint [17] applied to the unary encoding. 

Definition 1 (ad-hoc equi-propagator for allDif f). An equi-propagator 
for ip = allDif f u (Us) where Us = [Ui , . . . , U m ] is defined as fJ, v {E) = E U 
E' where E' = { U-(v) = U'^v + 1) | % e {1, . . . , m} - H, v e V } if there exists 
a Hall set H C {l,...,m} where V = Ui e Hdom(Ul), \V\ = \H\ and denoting 
waify E (Us) = [U{,..., U' m \. Otherwise, E' = 0. 

After a Hall set H is detected (and equi-propagation has triggered) , we also 
apply an additional decomposition rule: 

allDif f (Us) i-> allDif f ([U i £ ff]) A allDif f ([Ui | i € {1, . . . , m} - H]) 

The benefit arises because the first allDif f constraint is guaranteed to represent 
a permutation which then benefits from permutation^. 

For the three cases in Example 5 we have: (1) H={1} and T^={2}, (2) 
H={2, 3, 4, 5}, and V={0, 1, 3, 4}, and (3) H={3, 4, 5} and V"={0, 2, 4}. Indeed 
we can convert any finite domain propagator to an equi-propagator. The follow- 
ing holds simply because the unary encoding can represent arbitrary domains. 

Proposition 6. Let E G £ and c(Xs) be a constraint over integer variables 
Xs = [Xi, . . . ,X m j. Let unif j E (Xs) = [X[, . . . , X' m ]. Suppose D is the mapping 
from variables to sets of value D(Xi) = dom{X' i ) and suppose propagator f for 
c(Xs) maps D to D' . Then a correct equi-propagator for c(Xs) discovers new 
equality literals E' = {X[{v) = X[{v + 1) | i G {1, . . . , m}, v € D{Xi) - D'(X t )}. 

Note that complete equi-propagators can determine more information than 
finite domain propagation as illustrated by the example for E 2 in Example 1. To 
complete this section, consider the following example. 
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Fig. 2. An example of equi-propagation with E\ to specify that X\ G {1,4}, X2 G 
{1, 4}, X 3 6 {0, 1, 3}, X 4 G {0, 3, 4}, and X 5 G {0, 1, 2, 3, 4}. 

Example 6. Consider a constraint allDiff(Xs) where Xs = [X\, . . . ,X§\ with 
each Xi in the interval [0,4], depicted as Fig. 2(a). Consider also the equi- 
formula E 1 which specifies that X±,X 2 G {1,4}, X 3 G {0,1,3}, X 4 G {0,3,4}. 
Fig. 2(b) depicts unify B (Xs). Constraint simplification proceeds in two steps. 
First, equi-propagation adds equi- formula E2, the affect of which is depicted 
as Fig. 2(c) where the constraint is also decomposed to constraints: The upper 
part, allDif f (X\, X2), and the lower part allDif f (X3, X4, X5). In the second 
step equi-propagation adds equi- formula E3 , the impact of which is depicted as 
Fig. 2(d). The original constraint is now fully solved, Xs is represented using 
only 2 propositional variables and the CNF encoding will contain no clauses. 

4 Optimized SAT encodings using Equi-propagation 

Boolean equi-propagation is at the foundation of our optimizing CNF compiler. 
The compiler repeatedly applies: equi-propagation, constraint decomposition, 
restriction and elimination, and finally outputs CNF encodings. We assume that 
each constraint comes with an associated equi-propagator. 

Given a conjunction <P of constraints, we first apply cqui-propagators. Each 
such application effectively removes at least one bit from the Boolean repre- 
sentation of <P. During this process, when no further equi-propagators can be 
applied, we may apply a decomposition rule to a high-level constraint, introduc- 
ing additional low-level constraints, but without introducing additional bits in 
the system. The actual implementation is of course less naive. It takes care to 
wakeup equi-propagators only when they may generate new information, and 
it makes use of the most efficient implementation of the equi-propagator pos- 
sible, so avoiding BDD based propagators if we have an equivalent propagator 
implemented directly. 

The complexity of the compiler is measured in the size of the constraint sys- 
tem <P it is optimizing. Denote by |#| c , the total number of low-level constraints 
in $ after decomposing all high-level constraints, and by \<P\b the total number of 
Boolean variables in the bit representation of <P. Assuming that equi-propagators 
are of polynomial cost, then so is the cost of running the compiler itself. 



!) 



Proposition 7. Let <P be a conjunction of high- and low-level (finite domain) 
constraints. Then the number of equi-propagation steps performed when compil- 
ing <P is bound by 0(\<P\ C x |<Z>| b ). 

Proof, (sketch) Each pass of the algorithm covers at most \<P\ C constraints and 
removes at least one of the |^|b Boolean variables from <P. □ 

After equi-propagation and constraint decomposition triggers no more, we 
apply constraint restriction and elimination rules. We say that a constraint ip is 
redundant with respect to an equi-formula E if either (a) unify E (ip) is a tau- 
tology or (b) there exists another constraint ip' in the constraint store such that 
unify^y/) ^ unify £ (iy9). Our implementation is tuned to identify a collection 
of adhoc cases. However in general, where BDDs have been applied to implement 
complete propagators, such tests are particularly easy. Testing for (a) is trivial. 
Testing for (b) is also straightforward for BDDs e.g., using "bddLeq" in [18]. 
However we only apply this rule in a restricted form due to the quadratic time- 
complexity of examining all pairs of constraints. Namely, to determine cases of 
the form true u {X\ 1 . . . , X n ) \= c(X\, . . . , X n ) where the constraint is redundant 
with respect to the unary encoding of its variables. 

Example 7. Take tp = dif f U (A, B) and E' from Example 4. The BDD for ip" = 
unifyg, (ip) is shown in Figure 3(c). One can check that unary(A, [0,4]) |= ip" 
using "bddLeq" indicating that the original constraint dif f (A, B) is redundant. 

In the final stage, when no further simplification applies, constraints are 
encoded to CNF formula. This can be performed either using their Boolean 
specification, or if BDD based propagators were applied, then we can read off 
the encoding from the BDD using standard techniques. 

Redundant constraints (with subscript # in the name) that were introduced 
in the model only to improve equi-propagation need not be encoded to CNF 
clauses. However, when we expect such redundant clauses to facilitate unit 
propagation during SAT solving, then we do add them. For instance, we add 
clauses to encode redundant permutation^ constraints. Each such constraint 
ip' = permutation_^([C/i, . . . , U m ]) is affiliated with a corresponding allDiff 
constraint. If S = U™ 1 dom([/i), |5| ^ m then the allDiff constraint does not 
represent a permutation and nothing is added. Otherwise we create additional 
Boolean variables bi V to represent the expressions Ui — v,v £ S. Let the unary 
encoding of Ui be (ui, . . . ,Ufe). We add clauses encoding bi V O (u v A -iu„_|_i) 
to connect these to the unary encoding, and the clauses V^-l&^Vu £ S to get 
better propagation from permutations. 

5 Implementation, Experiments, and Extensions 

All experiments were performed on an Intel Core 2 Duo E8400 3.00GHz CPU 
with 4GB memory under Linux (Ubuntu lucid, kernel 2.6.32-24-generic). 5 Our 

5 The benchmark instances and encodings can be viewed at 
http: //www . cs .bgu. ac . il/~ amitmet/csp2011/. 
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prototype constraint compiler is written in Prolog and run using SWI Prolog 
v5.10.2 64-bits. Complete propagators are implemented using the BDD package, 
CUDD v2.4.2. Comparisons with Sugar (vl.14.7) are based on the use of identical 
constraint models, apply the same SAT solver (CryptoMiniSat v2.5.1), and run 
on the same machine. Comparisons with Minion (vO.10) are based on the use of 
identical constraint models, and run on the same machine (with minor differences 
due to syntax). For each of the example problems we extend (our description of) 
the constraint modelling language as required for the benchmarks. 



instance 


compiler 


Sugar 


3D 


CSP'08 


OSC'09 


FS'09 


num 


un/sat 


compl 


enf size 


SAT 


enf size 


SAT 


enf size 


SAT 












(sec.) 


(clauses) 


'sec.) 


(clauses) 


(sec.) 


(clauses) 


sec.) 


(sec.) 


(sec.) 


(sec.) 


1 


sat 


0.41 


6509 


2.46 


140315 


37.36 


6507 


0.09 


31.55 


34.81 


6.44 


2 


sat 


0.33 


7475 


0.02 


140920 234.70 


7438 


0.74 


137.60 


99.84 


44.80 


3 


sat 


0.38 


6531 


0.02 


140714 


17.02 


6512 


0.02 


10.57 


12.25 


2.53 


4 


sat 


0.38 


6818 


0.61 


141581 


90.64 


6811 


0.08 


47.24 


273.36 


157.58 


5 


sat 


0.35 


7082 


0.32 


140431 


206.03 


7099 


0.14 


27.33 


24.87 


22.30 


6 


sat 


0.33 


7055 


0.45 


140625 


67.84 


7044 


1.11 


35.78 


108.60 


12.58 


7 


sat 


0.33 


7711 


2.36 


142200 


60.97 


7684 


0.08 


57.23 


67.32 


341.62 


8 


sat 


0.35 


7426 


0.05 


140784 


34.43 


7367 


0.04 


43.88 


1.52 


6.08 


9 


sat 


0.37 


6602 


0.28 


137589 


33.76 


6609 


0.41 


25.15 


9.52 


3.01 


10 


sat 


0.36 


6784 


0.17 


142303 


50.86 


6799 


0.06 


26.16 


27.80 


12.66 


11 


unsat 


0.45 


6491 


0.05 


140603 


39.02 


6534 


0.03 


19.47 


30.92 


5.30 


12 


unsat 


0.23 


1 


0.00 


139037 


0.58 


7393 


0.00 


0.36 


0.05 


0.81 


13 


unsat 


0.28 


1 


0.00 


141295 


0.90 


6555 


0.00 


1.47 


0.16 


0.80 


14 


unsat 


0.28 


1 


0.00 


140706 


2.25 


7173 


0.00 


1.40 


0.29 


0.80 


15 


unsat 


0.38 


6063 


0.05 


140224 


35.93 


6104 


0.06 


32.39 


58.41 


4.77 



Table 1. QCP results for 25 x 25 instances with 264 holes 



Quasigroup Completion Problems (QCP) are given as an n x n board of 
integer variables (in the range [l,n]) in which some are assigned integer values. 
The task is to complete the board, assigning values to all variables, so that no 
column or row contains the same value twice. A model for a QCP instance is a 
conjunction of allDif f constraints corresponding to the variables (and values) 
in its rows and columns. 

Table 1 illustrates results for 15 (of the largest) instances from the 2008 CSP 
competition 6 with data for our compiler (compilation time, number of clauses, 
SAT solving time), Sugar (number of clauses, subsequent SAT solving time), the 
so-called 3D SAT encoding of [12] (number of clauses after unit propagation, SAT 
solving time), and from: CSP'08 (the winning result from the 2008 competition), 
OSC'09 and FS'09 (results for lazy clause generation solvers reported in [15] and 
[9]). It is, by now, accepted that the 3D encoding is strong for QCP problems, a 
fact echoed by the results of Table 1. Observe that for 3 instances, unsatishablc 
is detected directly by the compiler (where the CNF contains 1 empty clause). 

Table 2 illustrates results for larger (40 x 40, satishable) instances 7 with 800- 
1000 holes. We compare the order-encoding (compiled) and the 3D-encoding 
(with unit propagation). The CNF sizes before compilation/unit propagation are 

6 http: //www. cril .univ-artois . f r/CPAI08/ 

7 Generated using lsencode from http://www.cs.cornell.edu/gomes/SOFT. 
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inst. 

800 

holes 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



order cnc. 



CNF" 

mCl 



(sec) 



0.11 
0.11 
0.11 
0.11 
0.11 
0.11 
0.11 
0.11 
0.11 
0.11 



18.24 
2.88 
6.54 
0.34 

21.50 
0.68 

13.79 

25.16 
9.46 
4.59 



3D 
CNF 



enc. 



mCl (sec) 



0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 



6.87 
3.70 
2.50 
1.47 
7.09 
1.78 
2.75 
0.48 
12.92 
1.43 



inst . 
1000 

holes 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



order 



CNF" 

mCl 



cnc. 

(sec) 



0.31 
0.31 
0.31 
0.31 
0.31 
0.30 
0.30 
0.32 
0.31 
0.30 



0.40 
0.39 
0.39 
0.39 
0.39 
0.39 
9.22 
0.39 
0.38 
0.65 



3D cnc. 
CNF SAT 

mCl (sec) 



0.38 27.78 
0.38 0.33 
0.39 19.76 



0.38 
0.38 
0.37 
0.37 
0.39 
0.38 
0.37 



8.73 
0.35 
3.13 
0.34 
0.36 
4.21 
8.26 



Table 2. QCP 40 x 40. CNF size in million's clauses 



circa 2.74 million clauses for the order-encoding and 3.74 for the 3D-encoding. 
The advantage of the 3D encoding is no longer clear. 

Nonogram Problems are expressed as a board of cells to color black or white, 
given clues per row and column of the board. A clue is a number sequence 
indicating blocks of cells to be colored black. For example, the clue (4, 8, 3) on a 
row (or column) indicates that it should contain contiguous blocks of 4, 8 and 3 
black cells (in that order) and separated by non-empty sequences of white cells. 

A Nonogram puzzle is modeled as a Boolean matrix with constraints per row 
and column, each about a clue (sequence of numbers) {b\, . . . , 6^), and about a 
Boolean vector, Vec (a row or col- 
umn of the matrix). Each number 
bi is associated with an integer vari- 
able indicating the index in Vec 
where block bi starts. For notation, if 
U = (ui, . . . , u n ) is an integer vari- 
able (order-encoding) then U +c is the 
vector with c ones prefixing the bits 
of U and represents the value U + c. 
Similarly, if U is greater than c then 
U~ c = (itc+i, . . • , u n ) represents the 
value U — c. We introduce two additional constraints 

[¥] bloc^(U ll U 2l Vec) [I] leq(E/i, U 2 ) 

The first specifies that for a bit vector Vec the variables in the indices greater 
than value U\ and less equal value U2 (with U\ < U2) are true. The second 
specifics that for integer variables U\ and U2 in the order-encoding, Ui < [/ 2 . 
The Boolean functions corresponding to constraints of these forms are as follows: 

block(£/i,f7 2 ,(a;i,...,a;„)) = ALi ("^i (*) A U 2 (i) -> Xi , . 

leq((a:i,...,a;„),<j/i,...,j/n» = A7=i x i ~> V* 

Example 8. The constraints below model the position of block sequence s = 
(3,1,2) in X — (xi, . . . ,xg). In the first column, integer variables, Ui,U2,Uz 
model the start positions of the three blocks. In the second column, the start 
position of a block is required to be at least one after the end position of its prede- 



instance 


compiler 






id size 


comp 


cnf 


sat 


BGU 


Walt. 


9717 (30x30) 


0.13 


14496 


124.43 


00 


00 


10000 (50x40) 


0.28 


44336 


40.66 


00 


00 


9892 (40x50) 


0.57 


30980 


0.44 


00 


00 


2556 (45x65) 


0.13 


2870 


0.00 


15.85 


0.4 


10088 (63x52) 


0.64 


78482 


1.26 


0.27 


0.08 


2712 (47x47) 


0.31 


43350 


0.92 


5.98 


4.95 


8478 (50x50) 


0.40 


51027 


0.95 


0.89 


00 


6727 (80x80) 


1.11 


156138 


2.86 


0.5 


0.17 


8098 (19x19) 


0.02 


3296 


0.06 


209.54 


8.63 


6574 (25x25) 


0.10 


7426 


0.03 


37.56 


2.94 



Table 3. Human Nonograms Results 
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cessor. In the third column, block constraints specify the black cells in the vector 
X, and in the fourth column the white cells in the block X = (—ix\ t . . . , ~^xq). 

unary(t/i,[l,9]) block^f 1 , U? 3 ,X) block(0, U u X) _ 

unary(f7 2 ,[l,9]) leq(U+ 4 ,U 2 ) block^" 1 , U} 1 , X) block(t/+ 4 , U 2 , X) 
unary(t/ 3 ,[l,9]) leq(£/+ 2 , C/ 3 ) block^" 1 , [7 3 + 2 , X) block({/ 2 + 2 , U 3 , X) 

Tables 3 & 4 compare ours to the two fastest documented Nonogram solvers: 
BGU (vl.0.2) [16] and Wolter (vl.09) [23 
Table 3 is about "human-designed" in- 



time (sec) 


0.20 0.50 1.00 10.00 30.00 60.00 


BGU 
Wolter 
Compiler 


279 3161 4871 4978 4989 4995 
4635 4782 4840 4952 4974 4976 
13 4878 4994 5000 5000 5000 



Table 4. 5,000 Random Nonograms Results 



stances from [21]. These are the 10 
hardest problems for the BGU solver. 
The first 8 puzzles have at least 2 solu- 
tions. The last 2 have a single solution. 
Solving time is for determining the number of solutions (0, 1, or more). For our 
compiler, the columns indicate: compilation time, enf size (number of clauses) 
and sat solving time. The final two columns are about the solution times for 
the BGU and Wolter solvers (running on the same machine). The timeout for 
these solvers (indicated by oo) is 300 sec. Table 4 reports on a collection of 5,000 
random puzzles from [22]. For each of the three solvers we indicate how many 
puzzles it solves within the given allocated time. 

BIBD Problems (CSPlib problem 28) arc defined by a 5-tuple of positive 
integers {v, b, r, k, A) and require to partition v distinct objects into b blocks 
such that each block contains k different objects, exactly r objects occur in each 
block, and every two distinct objects occur in exactly A blocks. To model BIBD 
problems we introduce three additional constraints 

sumBits([Si,...,B„],[/) \T\ uadder([/i,[/ 2 ,[/3) 

pairwise_and([Ai, . . . , A n ), [Bi, . . . , B n ), [Ci, . . . , C n }) 



The first (high-level) constraint states that the sum of bits, [B\, . . . , B n ] is the 
unary value U. It is defined by decomposition: split the bits into two parts, sum 
the parts, then add the resulting (unary) numbers. The sum of two unary num- 
bers, U1 + U2 = U3, is specified by the (low-level) constraint uadder([/i, U2, U3). 
To compute the scalar product of vectors [A\, . . . , A n ] and [Bi, . . . , B n ] we use 
the pairwise_anc? constraint in combination with sumBits. 

The model for BIBD instance (v,b,r,k, A) is a Boolean incidence matrix 
with constraints: sumBits(C, k) for each column C; sumBits(i?, r) for each row 
R; and for each pair of rows Rj (i < j), pairwise_a?i(i(i?i, R 3 ■, Vs) and 
sumBits(Us, A). To break symmetry, we can reorder rows and columns of the 
matrix to assign fixed values in the first two rows and leftmost column: the first 
row contains r ones, followed by zeros. The second row contains A ones, r — A 
zeros, r — A ones, and then zeros. The left column contains k ones followed by 
zeros. This is the information that enables the compiler to simplify constraints. 

Table 5 shows results comparing our compiler using the model we call SymB 
for symmetry breaking (compilation time, enf size, and sat solving time) with the 
Minion constraint solver [11]. Ignore for now the last 3 columns about SatELite. 
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instance 


compiler (SymB) 


Minion 


SatELite (SymB) 


(ik 6, r, fe, A) 


comp 


enf size 


SAT 


[M'06] 


SymB 


SymB + 


prcpro 


enf size 


SAT 




(sec.) 


(clauses) 


(sec.) 


(sec.) 


(sec.) 


(sec.) 


(sec.) (clauses) 


[sec.) 


(7,350,150,3,50) 


1.34 


494131 


1.23 


0.47 


1.12 


0.38 


1.27 


566191 


1.65 


(7,420, 180,3,60) 


1.65 


698579 


1.73 


0.54 


1.36 


0.42 


1.67 


802576 


2.18 


(7, 560, 240, 3, 80) 


3.73 


1211941 


13.60 


0.66 


1.77 


0.52 


2.73 


1397188 


5.18 


(8, 84, 42, 4, 18) 


0.25 


64432 


0.17 


2.41 


oo 


0.70 


0.31 


73780 


0.15 


(8,98,49,4,21) 


0.33 


84993 


0.23 


5.63 


CO 


1.54 


0.34 


97588 


0.33 


(12,132,33,3,6) 


0.95 


180238 


0.73 


5.51 


oo 


1.76 


1.18 


184764 


0.57 


(13,26,8,4,2) 


0.12 


17570 


0.05 


5.46 


0.47 


0.16 


0.22 


17391 


0.10 


(15, 45, 24, 8, 12) 


0.51 


116016 


8.46 


CO 


CO 


75.87 


0.64 


134146 


CO 


(15,70,14.3.2) 


0.56 


81563 


0.39 


12.22 


1.42 


0.31 


1.02 


79542 


0.20 


(16,80, 15.3.2) 


0.81 


109442 


0.56 


107.43 


13.40 


0.35 


1.14 


105242 


0.35 


(19, 19,9,9,4) 


0.23 


39931 


0.09 


53.23 


38.30 


0.31 


0.4 


44714 


0.09 


(19,57,9,3, 1) 


0.34 


113053 


0.17 


oo 


1.71 


0.35 


10.45 


111869 


0.14 


(21, 21, 5, 5, 1) 


0.02 





0.00 


1.26 


0.67 


0.15 


0.01 





0.00 


(25,25,9,9,3) 


0.64 


92059 


1.33 


oo 


CO 


0.92 


1.01 


97623 


8.93 


(25,30,6,5,1) 


0.10 


24594 


0.06 


oo 


1.37 


0.31 


1.2 


23828 


0.05 


(31,31,6,6,1) 


0.08 


8571 


0.03 


oo 


2.10 


0.36 


0.28 


8001 


0.03 


Total 


40.53 


84.40 


> 223.82 



Table 5. BIBD results (180 sec. timeout) 



We will come back to explain these in Section 6. All experiments were run on the 
same computer. We consider three different models for Minion: [M'06] indicates 
results using the BIBD model described in [11], SymB uses the same model we 
use for the SAT approach, SymB + , is an enhanced symmetry breaking model 
with all of the tricks applied also in the [M'06] model. For the columns with 
no timeouts we show total times (for the compiler this includes compile time 
and sat solving). Note that by using a clever modeling of the problem we have 
improved also the previous runtimes for Minion. 

Word Design for DNA (Problem 033 of CSPLib) seeks the largest pa- 
rameter n, s.t. there exist a set S of n eight-letter words over the alphabet 
£ = {A, C, G, T} with the following properties: (1) Each word in S has 4 sym- 
bols from {C, G}; (2) Each pair of distinct words in S differ in at least 4 positions; 
and (3) For every x,y £ S: x R (the reverse of x) and y c (the word obtained by 
replacing each A by T, each C by G, and vice versa) differ in at least 4 positions. 

In [10], the authors present the "template-map" strategy for this problem. 
Letters are modelled by pairs (ti, mi) of bits. For each eight-letter word, (ti, . . . , t$) 
is the template and (mi, . . . , ms) is the map. The authors pose conditions on a set 
of templates T and a set of maps M so that the cartesion product S = TxM will 
satisfy the requirements of the original problem. It is this template-map strategy 
that we model in our encoding. The authors report a solution composed from 
two template-maps (T^Mi) and (T 2 ,M 2 ) where |Ti| = 6, \M±\ = 16, |T 2 | = 2, 
|M 2 | = 6. This forms a set S with (6 x 16) + (2 x 6) = 108 DNA words. Marc 
van Dongen reports a larger solution with 112 words. 8 To model this problem 
we introduce the two constraints (where Vi are vectors of bits). 



lexleq([Vi,...,V r „]) 10 lexleq(Fi, V 2 ) 



See http : //www. cs . st-andrews . ac .uk/~ ianm/CSPLib/. 
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The first specifies that a list of vectors is ordered in the lexicographic order. It 
decomposes to the low-level constraint (the second) that specifies that a pair of 
vectors is ordered in the lexicographic order. 

Using our compiler, we find a template and a map of sizes 14 and 8, the 
cartesian product of which gives a solution of size 14 x 8 = 112 words. The 
SAT solving time is less than 0.2 seconds. To show that there is no templete 
of size 15 and no map of size 9 takes 0.14 and 3.32 seconds respectively. This 
is a new result not obtainable using previous solving techniques. We obtain 
this result when symmetries are broken by ordering the vectors in T and in M 
lexicographically. Proving that there is no solution to the original DNA word 
problem with more than 112 words (not via the template-map strategy) is still 
an open problem. 

6 Related Work and Conclusion 

There is a considerable body of work on CNF simplification techniques with a 
clear trade-off between amount of reduction achieved and invested time. Most 
of these approaches determine binary clauses implied by the CNF, which is cer- 
tainly enough to determine Boolean equalities. The problem is that determining 
all binary clauses implied by the CNF is prohibitive when the SAT model may 
involve many thousands of variables. Typically only some of the implied binary 
clauses are determined, such as those visible by unit propagation. The trade- 
off is regulated by the choice of the techniques applied to infer binary clauses, 
considering the power and cost. See for example [7] and the references therein. 

In our approach, the beast is tamed by introducing a notion of locality. We do 
not consider the full CNF. Instead, by maintaining the original representation, 
a conjunction of constraints, each viewed as a Boolean formula, we can apply 
powerful reasoning techniques to separate parts of the model and maintain effi- 
cient preprocessing. Our specific choice, using BDD's for bounded sized formula, 
guarantees that reasoning is always polynomial in cost. 

To illustrate one difference consider again Example 6 where cqui-propagation 
simplifies the constraint so that it is expressed in 2 propositional variables and 
requires clauses. In contrast, the CNF representing the allDiff constraint 
with the initial equations E\ consists of 76 clauses with 23 variables and after 
applying SatELite [7] this is reduced to 57 clauses with 16 variables. Examining 
this reduced CNF reveals that it contains binary clauses corresponding to the 
equations in E2 but not those from £3. 

Finally, we come back to (the last 3 columns in) Table 5 where a comparison 
with SatELite is presented. It is interesting to note that in some cases prepro- 
cessing results in smaller CNF and faster SAT solving, however in total (even if 
not counting the timeout for BIBD instance (15, 45, 24, 8, 12)) equi-propagation 
is stronger. 

Using equi-propagation on a high level view of the problem allows us to 
simplify the problem more aggressively than is possible with a CNF representa- 
tion. The resulting CNF models can be significantly smaller than those result- 
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ing from straight translation, and significantly faster to solve. Hence we believe 
that Boolean equi-propagation, combined with CNF simplification tools (such as 
SatELite), makes an important contribution to the encoding of CSPs to SAT. 
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